Datamining protein structure databanks for crystallization patterns of proteins.
نویسندگان
چکیده
A study of 345 protein structures selected among 1,500 structures determined by nuclear magnetic resonance (NMR) methods, revealed useful correlations between crystallization properties and several parameters for the studied proteins. NMR methods of structure determination do not require the growth of protein crystals, and hence allow comparison of properties of proteins that have or have not been the subject of crystallographic approaches. One- and two-dimensional statistical analyses of the data confirmed a hypothesized relation between the size of the molecule and its crystallization potential. Furthermore, two-dimensional Bayesian analysis revealed a significant relationship between relative ratio of different secondary structures and the likelihood of success for crystallization trials. The most immediate result is an apparent correlation of crystallization potential with protein size. Further analysis of the data revealed a relationship between the unstructured fraction of proteins and the success of its crystallization. Utilization of Bayesian analysis on the latter correlation resulted in a prediction performance of about 64%, whereas a two-dimensional Bayesian analysis succeeded with a performance of about 75%.
منابع مشابه
Sequence-Based Protein Crystallization Propensity Prediction for Structural Genomics: Review and Comparative Analysis
Structural genomics (SG) is an international effort that aims at solving three-dimensional shapes of important biological macro-molecules with primary focus on proteins. One of the main bottlenecks in SG is the ability to produce diffraction quality crystals for X-ray crystallography based protein structure determination. SG pipelines allow for certain flexibility in target selection which moti...
متن کاملA series of PDB-related databanks for everyday needs
We present a series of databanks (http://swift.cmbi.ru.nl/gv/facilities/) that hold information that is computationally derived from Protein Data Bank (PDB) entries and that might augment macromolecular structure studies. These derived databanks run parallel to the PDB, i.e. they have one entry per PDB entry. Several of the well-established databanks such as HSSP, PDBREPORT and PDB_REDO have be...
متن کاملGrouping of bread wheat cultivars by seed storage proteins. Sonia Kahrizi1, Mohammad Sedghi2* and Omid Sofalian2
To determine seed storage protein banding patterns in some bread wheat cultivars and the similarity of banding patterns among different cultivars, an experiment based on seed storage protein electrophoresis (albumin and globulin) was performed. Water and salt soluble proteins were extracted in sixteen wheat cultivars using polyacrylamide gel electrophoresis and banding pattern was obtained. Stu...
متن کاملProtein Secondary Structure Prediction: a Literature Review with Focus on Machine Learning Approaches
DNA sequence, containing all genetic traits is not a functional entity. Instead, it transfers to protein sequences by transcription and translation processes. This protein sequence takes on a 3D structure later, which is a functional unit and can manage biological interactions using the information encoded in DNA. Every life process one can figure is undertaken by proteins with specific functio...
متن کاملStudy of the Diversity in Different Cultivars of Pistacia vera L. Resistant to Drought and Salinity: Comparing Protein Patterns Using SDS-PAGE Method
Pistachio is one of the most important agricultural products that have always been associated with Iran, and its production has a long historical background in our country. In this research, protein patterns of 10 cultivars of Pistacia vera L. were compared in which cultivars grown in normal conditions where compared with cultivars grown in salinity and water shortage to determine diversity. Fo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Annals of the New York Academy of Sciences
دوره 980 شماره
صفحات -
تاریخ انتشار 2002